56 research outputs found
A method for extracting travel patterns using data polishing
With recent developments in ICT, the interest in using large amounts of accumulated data for traffic policy planning has increased significantly. In recent years, data polishing has been proposed as a new method of big data analysis. Data polishing is a graphical clustering method, which can be used to extract patterns that are similar or related to each other by identifying the cluster structures present in the data. The purpose of this study is to identify the travel patterns of railway passengers by applying data polishing to smart card data collected in the Kagawa Prefecture, Japan. To this end, we consider 9,008,709 data points collected over a period of 15 months, ranging from December 1st, 2013 to February 28th, 2015. This dataset includes various types of information, including trip histories and types of passengers. This study implements data polishing to cluster 4,667,520 combinations of information regarding individual rides in terms of the day of the week, the time of the day, passenger types, and origin and destination stations. Via the analysis, 127 characteristic travel patterns are identified in aggregate
Asymptotic properties of parametric and nonparametric probability density estimators of sample maximum
Asymptotic properties of three estimators of probability density function of
sample maximum are derived, where is a function of
sample size . One of the estimators is the parametrically fitted by the
approximating generalized extreme value density function. However, the
parametric fitting is misspecified in finite cases. The misspecification
comes from mainly the following two: the difference and the selected block
size , and the poor approximation to the generalized extreme value
density which depends on the magnitude of and the extreme index .
The convergence rate of the approximation gets slower as tends to
zero. As alternatives two nonparametric density estimators are proposed which
are free from the misspecification. The first is a plug-in type of kernel
density estimator and the second is a block-maxima-based kernel density
estimator. Theoretical study clarifies the asymptotic convergence rate of the
plug-in type estimator is faster than the block-maxima-based estimator when
. A numerical comparative study on the bandwidth selection shows
the performances of a plug-in approach and cross-validation approach depend on
and are totally comparable. Numerical study demonstrates that the
plug-in nonparametric estimator with the estimated bandwidth by either approach
overtakes the parametrically fitting estimator especially for distributions
with close to zero as gets large
A semiparametric probability distribution estimator of sample maximums
Several approaches of nonparametric inference for extreme values have been
studied. This study surveys the semiparametric probability distribution
estimation of sample maximums. Moriyama (2021) clarified that the parametric
fitting to the generalized extreme value distribution becomes large as the tail
becomes light, which means the convergence becomes slow. Moriyama (2021)
proposed a nonparametric distribution estimator without the fitting of the
distribution and obtained asymptotic properties. The nonparametric estimator
was proved to outperform the parametrically fitting estimator for light-tailed
data. Moreover, it was demonstrated that the parametric fitting estimator
numerically outperformed the nonparametric one in other cases.
Motivated by the study, we construct two types of semiparametric distribution
estimators of sample maximums. The proposed distribution estimators are
constructed by mixing the two distribution estimators presented in Moriyama
(2021). The cross-validation method and the maximum-likelihood method are
presented as a way of estimating the optimal mixing ratio. Simulation
experiments clarify the numerical properties of the two types of semiparametric
distribution estimators
Recommended from our members
NEW CONSTRUCTION OF THE SESERAGI FISHWAY IN MIYANAKA INTAKE DAM AND ADAPTIVE MANAGEMENT
Follow-Up of Patients Who Achieved Sustained Virologic Response after Interferon-Free Treatment against Hepatitis C Virus: Focus on Older Patients
Background and Objectives: Direct-acting antiviral agents (DAAs) have improved sustained virologic response (SVR) rates in patients with chronic hepatitis C virus (HCV) infection. Our aim was to elucidate the occurrence of hepatocellular carcinoma (HCC) and to compare the outcomes of patients aged 75 years or older (older group) with those of patients younger than 75 years (younger group) after SVR. Materials and Methods: Among 441 patients treated with interferon-free DAA combinations, a total of 409 SVR patients were analyzed. We compared the two age groups in terms of HCC incidence and mortality rates. Results: Older and younger groups consisted of 68 and 341 patients, respectively. Occurrence of HCC after SVR did not differ between the two groups of patients with a history of HCC. Occurrence of HCC after SVR was observed more in younger patients without a history of HCC (p < 0.01). Although older patients without a history of HCC had a higher mortality rate (p < 0.01), their causes of death were not associated with liver diseases. Among younger patients without a history of HCC, none died. Conclusions: After SVR, liver disease may not be a prognostic factor in older HCV patients without a history of HCC
- …